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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 



June 24, 2008 



Search time 26 7 Seconds 
(without alignments) 
1751.538 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-552-515-l_COPY_15 7_933 
4123 

1 QQDVQDGNTTVHYALLSASW SELSSHWTPFTVPKASQLQQ 777 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 3405708 seqs, 601879884 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 45 summaries 



Database : A_Geneseq_200711 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000 : * 

4: geneseqp2001 : * 

5: geneseqp2002 : * 

6: geneseqp2003a : * 

7: geneseqp2003b : * 
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8 : 


geneseqp2004a 


* 


9: 


geneseqp2004b 


* 


10: 


geneseqp2005 


* 


11: 


geneseqp2006 




12: 


geneseqp2007 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

Result Query 

No. Score Match Length DB ID Description 



1 


4123 
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ADT77664 


Adt77664 Splice va 
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4123 


100 


0 


933 


11 
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10 
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11 


AEG11146 
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11 
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11 
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12 
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35 
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AFB77190 
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13 


1437 


34 
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1003 
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ADG48280 
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34 
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16 
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Afb95185 Fruit fly 
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6 


ADC42854 


Adc42 85 4 REMAP pro 


18 


1369 


33 


2 


910 


11 


AEL84658 


Ael84658 Tumor mar 


19 


1367.5 


33 


2 


712 


11 


AEG11145 


Aeglll45 Human tra 


20 


1344 


32 


6 


1075 


4 


ABB65993 


Abb65993 Drosophil 


21 


1344 


32 


6 


1075 


10 


AFC04729 


Afc04729 Fruit fly 


22 


1159 . 5 


28 


1 


1058 


4 


ABB65022 


Abb65022 Drosophil 


23 


1159.5 


28 


1 


1058 


10 


AFC01816 


Afc01816 Fruit fly 


24 


1154 


28 


0 


596 


6 


ADB64387 


Adb6 438 7 Human pro 


25 


1061 . 5 


25 


7 


594 


4 


AAB92637 


Aab92637 Human pro 
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ALIGNMENTS 

RESULT 1 
ADT77664 

ID ADT77664 standard; protein; 933 AA. 

XX 

AC ADT77664; 

XX 

DT 15-JUN-2007 (revised) 

DT 13-JAN-2005 (first entry) 

XX 

DE Splice variant-novel gene expressed in prostate (SV-NGEP) polypeptide. 
XX 

KW Splice variant-novel gene expressed in prostate; SV-NGEP; human; 
KW prostate cancer; cytostatic; gene therapy; immunotherapy; BOND_PC; 
KW NGEP long variant; NGEP long variant [Homo sapiens]; G05886. 
XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT Domain 1. .345 

FT /label= Cytoplasmic 

FT Region 157. .933 

FT /note= "An immunogenic fragment comprising 8 consecutive 

FT amino acids that specifically binds to an antibody that 

FT specifixally binds to a polypeptide comprising amino 

FT acids 157-933 is referred to in Claim 1" 

FT Region 170. .178 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Region 215. .223 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Region 258. .266 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 346. .368 

FT /label= Transmembrane 
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FT Domain 369. .421 

FT /label= External 

FT /note= "Cell surface" 

FT Region 403. .411 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 422. .441 

FT /label= Transmembrane 

FT Region 427. .435 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 442. .501 

FT /label= Cytoplasmic 

FT Domain 502. .524 

FT /label= Transmembrane 

FT Domain 525. .543 

FT /label= External 

FT /note= "Cell surface" 

FT Domain 544. .566 

FT /label= Transmembrane 

FT Region 557. .565 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Region 562. .570 

FT /note= "Epitope, predicted to bind HLA2-01" 

FT Domain 567. .586 

FT /label= Cytoplasmic 

FT Domain 587. .609 

FT /label= Transmembrane 

FT Domain 610. .714 

FT /label= External 

FT /note= "Cell surface" 

FT Domain 715. .737 

FT /label= Transmembrane 

FT Domain 738. .761 

FT /label= Cytoplasmic 

FT Domain 762. .784 

FT /label= Transmembrane 

FT Domain 785. .933 

FT /label= External 

FT /note= "Cell surface" 

FT Region 846. .854 

FT /note= "Epitope, predicted to bind HLA2-01" 

XX 



PN WO2004092213-A1 . 

XX 

PD 28-OCT-2004. 

XX 

PF 05-APR-2004; 2004WO-US010588 . 
XX 

PR 08-APR-2003; 2003US-0461399P . 
XX 
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PA (USSR ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Pastan I, Bera TK, Lee B; 
XX 

DR WPI; 2004-758338/74. 

DR N-PSDB; ADT77665. 

DR PC:NCBI; gi48093524. 
XX 

PT New Splice Variant-Novel Gene Expressed in Prostate polypeptide or 

PT encoding nucleic acid molecule for diagnosing, preventing or treating 

PT cancer, especially prostate cancer. 
XX 

PS Claim 1; SEQ ID NO 1; 88pp; English. 

XX 

CC The present sequence is the protein sequence of splice variant-novel gene 

CC expressed in prostate (SV-NGEP) . SV-NGEP is identical to NGEP from amino 

CC acid 1-157, diverging from amino acid 158. Expression analysis in 76 

CC normal and foetal tissues showed SV-NGEP to be strongly expressed only in 

CC a prostate sample. Claimed methods for detecting prostate cancer in a 

CC subject comprise: contacting the sample with an antibody that 

CC specifically binds a SV-NGEP polypeptide and detecting the formation of 

CC an immune complex; or detecting an increase in expression of SV-NGEP 

CC polypeptide or mRNA. Antibodies to an SV-NGEP polypeptide can be used to 

CC detect metastatic prostate cancer cells at locations other than the 

CC prostate. A claimed method for producing an immune response against a 

CC cell expressing SV-NGEP, for example in a subject with prostate cancer, 

CC comprises administering the polypeptide, or a polynucleotide encoding it, 

CC to produce an immune response that decreases growth of the prostate 

CC cancer. A claimed method for inhibiting the growth of a malignant cell 

CC that expresses SV-NGEP comprises culturing cytotoxic T lymphocytes (CTLs) 

CC with SV-NGEP to produce activated CTLs that recognise an NGEP expressing 

CC cell, and contacting the malignant cell with the activated CTLs. 

CC Alternatively, growth of a malignant cell is inhibited by contact with an 

CC antibody that specifically binds an SV-NGEP polypeptide, where the 

CC antibody is linked to an effector molecule (chemotherapeutic agent or 

CC toxin) that inhibits growth of the malignant cell. This may be performed 

CC in vivo. Kits for detecting an SV-NGEP polypeptide or polynucleotide in a 

CC sample are also claimed. 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 933 AA; 

Query Match 100.0%; Score 4123; DB 8; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 777; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 
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I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 15 7 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 216 

Qy 61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 217 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 76 

Qy 121 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 277 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 336 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 33 7 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 3 96 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
Db 397 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 456 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 45 7 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMCLV 516 

Qy 361 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 517 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 576 

Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 577 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 636 

Qy 481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 40 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 63 7 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 96 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 

I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 697 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 756 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 75 7 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 816 

Qy 661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIP 72 0 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 817 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 8 76 

Qy 721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 777 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 877 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 2 
AEL84788 

ID AEL84788 standard; protein; 933 AA. 
XX 

AC AEL8 4 788; 
XX 

DT 18-OCT-2007 (revised) 

DT 15-JUN-2007 (revised) 

DT 28-DEC-2006 (first entry) 
XX 

DE Tumor marker gene NGEP SEQ ID NO 155. 

XX 

KW cytostatic; diagnosis; prognosis; tumor marker; gene expression; 

KW drug screening; cancer; neoplasm; NGEP; BOND_PC; NGEP long variant; 

KW G05886. 
XX 

OS Homo sapiens . 
XX 

PN WO2006110593-A2 . 
XX 

PD 19-OCT-2006. 

XX 

PF 07-APR-2006; 2006WO-US013172 . 

XX 

PR 07-APR-2005; 2005US-0669342P . 

PR ll-OCT-2005; 2005US-0725982P . 
XX 

PA (MACR-) MACROGENICS INC. 
XX 

PI Von Haller PD, Schummer M, Meyer DW, Schubert LA, Tjoelker LW; 
XX 

DR WPI; 2006-814687/82. 

DR N-PSDB; AEL84787. 

DR REFSEQ; NP_0 0 1 0 0 1 8 9 1 . 

DR PC:NCBI; gi48093524. 
XX 

PT Detecting or diagnosing cancer in a subject comprises determining 

PT expression of at least one gene, and comparing level of expression to a 

PT control sample from a normal subject, where increased expression level 

PT indicates cancer. 
XX 

PS Claim 8; SEQ ID NO 155; 583pp; English. 
XX 

CC The invention describes a method of detecting or diagnosing cancer in a 

CC subject comprising determining the expression level of at least one gene, 

CC and comparing the level of expression to a corresponding control sample 
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CC from a normal subject, where cancer is detected or diagnosed if there is 

CC an increase in the expression level of the gene relative to the 

CC expression in the control sample. Also described are: identifying a 

CC compound to be tested for its ability to prevent, treat, manage, or 

CC ameliorate cancer or its symptom; a compound identified by the method; 

CC treating cancer in a patient; treating a cancer in a subject that is 

CC fully or partially refractory to a first treatment in a patient; and a 

CC pharmaceutical composition comprising an amount of an antibody selected 

CC from anti-SLC12A2, anti-FLJ23375, anti-GRM5, anti-TAS2Rl, anti-NRXN2, 

CC anti-C14orf 160, anti-MGC 15668, anti-MGC33486 , anti-TMEM16F, anti-FAT, 

CC anti-KIAA0195, anti-LRFN, anti-NFASC, anti-BAT2Dl, anti-MGC2963, anti- 

CC KIAA0685, anti-EDGS, anti-GGTLS, anti-PLVAP, anti-FLJ31528 , anti- 

CC FLJ90709, anti-VEZATIN, anti-TMPRSS9 , anti-ATP13A5 , anti-PKHDlLl , anti- 

CC C2orfl8, anti-ANKRD22, anti-FAM62B, anti-LOC57168, anti-CDKALl, anti- 

CC SLC39A3vl, anti-SLC39A3v2, anti-BAT5, anti-TM9SF4, anti-DC2, anti-VAPB, 

CC anti-XTP3TPB, anti-TACSTD2 , anti-FNDC3A, anti-GKOOl, anti-0CIAD2, anti- 

CC PR01855, anti-C20orf3, anti-SDFRl, anti-FLJ20481, anti-LENG4, anti- 

CC FLJ12443, anti-ARP5 Long, anti-ARP5 Short, anti-TMD06 45 , anti-NGEP, anti- 

CC ILIRAPI, anti-PLXNBl, anti-ATP2B2, anti~FLJ11848, anti-ENTPD2, anti- 

CC PPMIH, anti-KRTKAP3, anti-KCNC3, anti-TM9SFl, anti-ULBPl, ant i-Cl 9orf 26 , 

CC anti-KIAA830, anti-KIAA1244, anti-KIAAl 797, anti-MGC26856, anti-NET02, 

CC anti-SUSD2, anti-F0LR2, anti-EMR2, ENTPDl, anti-ATPlOB, anti-PTK7, anti- 

CC FLJ14681, anti-C20orf 22, anti-FLJ14281, anti-FAM8Al, anti-TMED7, anti- 

CC C20orfl08, anti-ATADl, anti-GPR154, anti-C14orf 27, anti-OSAP, anti- 

CC FAD104, anti-FLJ90492, anti-SLC27A3, anti-RON, anti-ATP13Al , anti- 

CC DKFZP564D166, anti-ESSPL, anti-EXTL3, anti-KAIl, ant i-KIAAO 96 0 , anti- 

CC MTRNL, anti-SLC27Al, anti-GRIA, anti-0R4Ml, anti~KIAA16 79, or anti-UPK-lb 

CC antibody, and a pharmaceutical carrier. The methods are useful for 

CC detecting, diagnosing, and treating cancer, e.g. colon, lung, ovary, 

CC prostate, pancreas, or bladder cancer. This is the amino acid sequence of 

CC NGEP, altered levels of expression are useful in the diagnosis or 

CC prognosis of cancer. 

CC 

CC Revised record issued on 18-OCT-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 933 AA; 



Query Match 100.0%; Score 4123; DB 11; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 777; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 7 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 216 

Qy 61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 217 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 76 
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Qy 121 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 277 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 336 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 33 7 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 3 96 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 39 7 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 456 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 36 0 

I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 45 7 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 516 

Qy 361 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 517 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 576 

Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 577 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 636 

Qy 481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 63 7 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 96 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 697 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 756 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 757 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 816 

Qy 661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 817 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 8 76 

Qy 721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 777 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I 
Db 877 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 933 



RESULT 3 
AEB13426 

ID AEB13426 standard; protein; 885 AA. 
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XX 

AC AEB13426; 

XX 

DT 22-SEP-2005 (first entry) 
XX 

DE Human prostate specific polypeptide #2. 
XX 

KW Screening; diagnosis; drug delivery; prostate specific polypeptide; 

KW cancer; prostate tumor; cytostatic; neoplasm. 

XX 

OS Homo sapiens. 
XX 

PN WO2005062788-A2 . 

XX 

PD 14-JUL-2005. 
XX 

PF 16-DEC-2004; 2004WO-US042406 . 
XX 

PR 22-DEC-2003; 2003US-0531809P . 
XX 

PA (AVAL-) AVALON PHARM INC. 
XX 

PI Weigle B, Ebner R; 
XX 

DR WPI; 2005-497793/50. 

DR N-PSDB; AEB13425. 

XX 

PT Novel isolated prostate specific polypeptide, useful for treating cancer, 

PT and identifying agent that modulates activity of cancer related gene. 
XX 

PS Claim 12; SEQ ID NO 5; 59pp; English. 

XX 

CC The invention relates to an isolated prostate specific polypeptide 

CC comprising one or more immunogenic fragments. The invention also relates 

CC to a method of identifying an agent that modulates the activity of a 

CC cancer related gene involving contacting a compound with a cell 

CC containing a gene under conditions promoting the expression of the gene, 

CC detecting a difference in expression of the gene relative to when the 

CC compound is not present and identifying an agent that modulates the 

CC activity of a cancer related gene, a method of identifying an anti- 

CC neoplastic agent involving contacting a cell exhibiting neoplastic 

CC activity with a compound first identified as a cancer related gene 

CC modulator using and determining a decrease in neoplastic activity after 

CC contacting, when compared to when the contacting does not occur, or 

CC administering an agent first identified to an animal exhibiting a cancer 

CC condition and detecting a decrease in cancerous condition, a method of 

CC determining the cancerous status of a cell involving determining an 

CC increase in the level of expression in a cell of a gene where an elevated 

CC expression relative to a known non-cancerous cell indicates a cancerous 
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CC state or potentially cancerous state, an antibody that reacts with a 

CC prostate specific polypeptide, an immunocon jugate comprising the antibody 

CC and a cytotoxic agent, a method of treating cancer involving contacting a 

CC cancerous cell in vivo with an agent having activity against a prostate 

CC specific polypeptide and an immunogenic composition the prostate specific 

CC polypeptide. The prostate specific polypeptide is useful for identifying 

CC an agent that modulates the activity of a cancer related gene. The 

CC immunogenic composition is useful for treating cancer, preferably 

CC prostate cancer in an animal, e.g. human, which involves administering 

CC the immunogenic composition that is sufficient to elicit the production 

CC of cytotoxic T lymphocytes specific for the prostate specific 

CC polypeptide. The invention is useful for identifying anti-neoplastic 

CC agents. This sequence represents a human prostate specific polypeptide of 

CC the invention. 

XX 

SQ Sequence 885 AA; 

Query Match 90.7%; Score 3 739; DB 10; Length 885; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 702; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 158 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 217 

Qy 61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 218 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 77 

Qy 121 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 33 7 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 33 8 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 39 7 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 398 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 45 7 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I 
Db 458 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMCLV 517 

Qy 361 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 518 SIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 577 
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Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 578 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 63 7 

Qy 481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 638 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 697 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 698 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 75 7 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 758 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 817 

Qy 661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFE 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 818 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFE 859 



RESULT 4 
AEB13424 

ID AEB13424 standard; protein; 843 AA. 

XX 

AC AEB13 42 4; 
XX 

DT 22-SEP-2005 (first entry) 
XX 

DE Human prostate specific polypeptide #1. 

XX 

KW Screening; diagnosis; drug delivery; prostate specific polypeptide; 

KW cancer; prostate tumor; cytostatic; neoplasm. 

XX 

OS Homo sapiens . 

XX 

PN WO2005062788-A2 . 
XX 

PD 14-JUL-2005. 

XX 

PF 16-DEC-2004; 2004WO-US042406 . 
XX 

PR 22-DEC-2003; 2003US-0531809P . 
XX 

PA (AVAL-) AVALON PHARM INC. 
XX 

PI Weigle B, Ebner R; 

XX 

DR WPI; 2005-497793/50. 
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DR N-PSDB; AEB13423. 
XX 

PT Novel isolated prostate specific polypeptide, useful for treating cancer, 

FT and identifying agent that modulates activity of cancer related gene. 

XX 

PS Claim 12; SEQ ID NO 3; 59pp; English. 
XX 

CC The invention relates to an isolated prostate specific polypeptide 

CC comprising one or more immunogenic fragments. The invention also relates 

CC to a method of identifying an agent that modulates the activity of a 

CC cancer related gene involving contacting a compound with a cell 

CC containing a gene under conditions promoting the expression of the gene, 

CC detecting a difference in expression of the gene relative to when the 

CC compound is not present and identifying an agent that modulates the 

CC activity of a cancer related gene, a method of identifying an anti- 

CC neoplastic agent involving contacting a cell exhibiting neoplastic 

CC activity with a compound first identified as a cancer related gene 

CC modulator using and determining a decrease in neoplastic activity after 

CC contacting, when compared to when the contacting does not occur, or 

CC administering an agent first identified to an animal exhibiting a cancer 

CC condition and detecting a decrease in cancerous condition, a method of 

CC determining the cancerous status of a cell involving determining an 

CC increase in the level of expression in a cell of a gene where an elevated 

CC expression relative to a known non-cancerous cell indicates a cancerous 

CC state or potentially cancerous state, an antibody that reacts with a 

CC prostate specific polypeptide, an immunocon jugate comprising the antibody 

CC and a cytotoxic agent, a method of treating cancer involving contacting a 

CC cancerous cell in vivo with an agent having activity against a prostate 

CC specific polypeptide and an immunogenic composition the prostate specific 

CC polypeptide. The prostate specific polypeptide is useful for identifying 

CC an agent that modulates the activity of a cancer related gene. The 

CC immunogenic composition is useful for treating cancer, preferably 

CC prostate cancer in an animal, e.g. human, which involves administering 

CC the immunogenic composition that is sufficient to elicit the production 

CC of cytotoxic T lymphocytes specific for the prostate specific 

CC polypeptide. The invention is useful for identifying anti-neoplastic 

CC agents. This sequence represents a human prostate specific polypeptide of 

CC the invention. 

XX 

SQ Sequence 843 AA; 

Query Match 86.6%; Score 3572; DB 10; Length 843; 

Best Local Similarity 100.0%; Fred. No. 0; 

Matches 6 71; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 158 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 217 
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Qy 61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 218 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 77 

Qy 121 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 278 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 33 7 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I 1! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 33 8 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 39 7 

Qy 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 398 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 45 7 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 36 0 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 45 8 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAWVMCLV 517 

Qy 361 SIILYRAIMAIVVSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 42 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 518 SIILYRAIMAIVVSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 577 

Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 578 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 63 7 

Qy 481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 63 8 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 697 

Qy 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 698 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 75 7 

Qy 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 660 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 758 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 817 

Qy 661 SSFAAAHNRTC 6 71 

I I I I I I I I I I I 
Db 818 SSFAAAHNRTC 82 8 



RESULT 5 
ABG15488 

ID ABG15488 standard; protein; 898 AA. 

XX 
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AC ABG15488; 
XX 

DT 18-FEB-2002 (first entry) 

XX 

DE Novel human diagnostic protein #15479. 

XX 

KW Human; chromosome mapping; gene mapping; gene therapy; forensic; 

KW food supplement; medical imaging; diagnostic; genetic disorder. 

XX 

OS Homo sapiens . 

XX 

PN WO200175067-A2 . 
XX 

PD ll-OCT-2001. 
XX 

PF 30-MAR-2001; 2001WO-US008631 . 
XX 

PR 31-MAR-2000; 2000US-00540217 . 

PR 23-AUG-2000; 2000US-00649167 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Drmanac RT, Liu C, Tang YT; 
XX 

DR WPI; 2001-639362/73. 

DR N-PSDB; AAS7 96 75. 

XX 

PT New isolated polynucleotide and encoded polypeptides, useful in 

PT diagnostics, forensics, gene mapping, identification of mutations 

PT responsible for genetic disorders or other traits and to assess 

PT biodiversity. 
XX 

PS Claim 20; SEQ ID NO 45847; 103pp; English. 
XX 

CC The invention relates to isolated polynucleotide (I) and polypeptide (II) 

CC sequences. (I) is useful as hybridisation probes, polymerase chain 

CC reaction (PGR) primers, oligomers, and for chromosome and gene mapping, 

CC and in recombinant production of (II). The polynucleotides are also used 

CC in diagnostics as expressed sequence tags for identifying expressed 

CC genes. (I) is useful in gene therapy techniques to restore normal 

CC activity of (II) or to treat disease states involving (II). (II) is 

CC useful for generating antibodies against it, detecting or quantitating a 

CC polypeptide in tissue, as molecular weight markers and as a food 

CC supplement. (II) and its binding partners are useful in medical imaging 

CC of sites expressing (II). (I) and (II) are useful for treating disorders 

CC involving aberrant protein expression or biological activity. The 

CC polypeptide and polynucleotide sequences have applications in 

CC diagnostics, forensics, gene mapping, identification of mutations 

CC responsible for genetic disorders or other traits to assess biodiversity 

http://es/ScoreAccessWeb/GetItem.action?AppId=10552...0-552-515-l_copy_157_933.rag&ItemType==4&startByte=0 (15 of 42)10/10/2008 8:50:59 AM 



SCORE Search Results Details for Application 10552515 and Search Result 20080624_135827_us-10-552-515-l_copy_157_933.rag. 

CC and to produce other types of data and products dependent on DNA and 

CC amino acid sequences. ABG00010-ABG30377 represent novel human diagnostic 

CC amino acid sequences of the invention. Note: The sequence data for this 

CC patent did not appear in the printed specification, but was obtained in 

CC electronic format directly from WIPO at 

CC ftp . wipo . int /pub/published_pct_sequences 

XX 

SQ Sequence 898 AA; 

Query Match 73.5%; Score 3031.5; DB 4; Length 898; 

Best Local Similarity 91.2%; Pred. No. 6.3e-316; 

Matches 578; Conservative 2; Mismatches 11; Indels 43; Gaps 2; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I : I : I I I I I I I I I 

Db 25 0 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQDYPTRPPTGRPACCAWLGIPNVL 309 

Qy 61 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 310 LEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 36 9 

Qy 121 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 370 GIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 42 9 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 0 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 48 9 

Qy 241 CLDCPFWLLSSACALAQ AGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 2 96 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 490 CLDCPFWLLSSACALAQVREEAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 5 49 

Qy 297 DCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVW 356 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 55 0 DCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAVVV 6 09 

Qy 35 7 MCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLT 416 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 610 MCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLT 669 

Qy 417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECA 4 76 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I 
Db 6 70 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECA 729 

Qy 477 AGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPW 536 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 730 AGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPW 789 
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Qy 



537 



EDDYELVPCEGLFDEYLEM VL 55 7 




Db 



790 



EDDYELVPCEGLFDEYLEMGAGFCPNACPELVPELTEPEKARDQPEARSAGQDSRPEAVL 8 4 9 



Qy 



558 



QFGFVT IFVAACPLAPLFALLNNWVEIRLDARKF 5 91 




Db 



850 



QFGFVT IFVAACPLAPLFALLNNWVE IRLDARKF 883 



RESULT 6 
ADB64420 

ID ADB64420 standard; protein; 920 AA. 
XX 

AC ADB6 4 42 0; 
XX 

DT 15-JUN-2007 (revised) 

DT 04-DEC-2003 (first entry) 

XX 

DE Human protein encoded by clone FEBRA20031280 . 
XX 

KW Human; pharmaceutical; diagnostic; gene therapy; tissue regeneration; 

KW cell regeneration; membrane protein; signal transduct ion-related protein; 

KW transcription-related protein; osteoporosis; neurological disease; 

KW cancer; tumour; BOND_PC; transmembrane protein 16D; 

KW transmembrane protein 16D (eight membrane-spanning domains); 

KW transmembrane protein 16D [Homo sapiens]; TMEM16D; FLJ34221; FLJ34272; 

KW FLJ35277; MGC130026; unnamed protein product; 

KW unnamed protein product [Homo sapiens] . 

XX 

OS Homo sapiens . 
XX 

PN EP1308459-A2 . 
XX 

PD 07-MAY-2003. 

XX 

PF 28-MAR-2002; 2002EP-00007401 . 
XX 

PR 05-NOV-2001; 2001JP-00379298 . 

PR 25-JAN-2002; 2002US-0350978P . 
XX 

PA (HELI-) HELIX RES INST. 

PA (REAS-) RES ASSOC BIOTECHNOLOGY. 

XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Irie R, Tamechika I; 

PI Seki N, Yoshikawa T, Otsuka M, Nagahari K, Masuho Y; 

XX 

DR WPI; 2003-450961/43. 

DR N-PSDB; ADB62450. 
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DR PC:NCBI; gi30520318. 
XX 

PT New polynucleotides and polypeptides, useful for developing a diagnostic 

FT marker or medicines for regulation of their expression and activity, or 

PT as targets of gene therapy. 

XX 

PS Claim 1; Page; 222pp; English. 

XX 

CC The invention discloses a polynucleotide comprising a sequence selected 

CC from 1970 fully defined nucleotide sequences which encode novel 

CC polypeptides. Also claimed is a polypeptide encoded by the polynucleotide 

CC or its partial peptide, an antibody binding to the polypeptide or peptide 

CC of the polynucleotide, immunologically assaying the polypeptide or 

CC peptide of the polynucleotide by contacting the polypeptide or peptide 

CC with the antibody of the encoded protein, and observing the binding 

CC between the two, a transformant carrying the polynucleotide in an 

CC expressible manner and an antisense polynucleotide. The oligonucleotide 

CC is useful as a primer for synthesising the polynucleotide, or as a probe 

CC for detecting the polynucleotide. The polynucleotides and encoded 

CC proteins are useful as pharmaceutical agents and many disease-related 

CC genes may be included in them, for developing a diagnostic marker or 

CC medicines for regulation of their expression and activity, or as targets 

CC of gene therapy. The genes are involved in tissue and/or cell 

CC regeneration. Membrane proteins, signal transduction-related proteins, 

CC transcription-related proteins, disease-related proteins and genes 

CC encoding them can be used as indicators for diseases (e.g. osteoporosis, 

CC neurological diseases, cancer, tumours. The cDNA may be used to regulate 

CC the activity or expression of the encoded protein to treat diseases. The 

CC sequence presented is a protein of the invention. Note: Some of the 

CC sequence data for this patent is not represented in the printed 

CC specification, but is based on sequence information supplied by the 

CC European Patent Office. 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 920 AA; 

Query Match 36.4%; Score 1502.5; DB 6; Length 920; 

Best Local Similarity 40.4%; Pred. No. 2.5e-151; 

Matches 328; Conservative 145; Mismatches 270; Indels 69; Gaps 20; 

Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I i : : : : I : I I : I : I II 

Db 122 NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRLRRWLPKK 181 

Qy 58 NVLL— EWPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 

: I I : I I : : I : I : : I : I : : I I I : I : I : II: I 

Db 182 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNATRSRIVHHILQRIKY-E 23 9 
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Qy 115 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I I I : I : : : I I I I I I I I I : I : : : I I : I :: I I I I I I I I 

Db 240 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 299 

Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 23 4 

I I I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

Db 300 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 35 9 

Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 2 93 

I I I : I III I I : I I : I I I : I I I I I : : I II : II : II : I II : I : I 
Db 36 0 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 418 

Qy 29 4 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

Db 419 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 478 

Qy 353 AVWMCLVSIILYRAIMAIWSRSGNTLLA-AWA SRIASLTGSW— NLVFILIL 40 4 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

Db 479 CWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVAT-TGTAVCINFCIIMLL 530 

Qy 405 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: : I : I : II I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
Db 531 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 590 

Qy 465 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
Db 591 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 6 47 

Qy 524 KAGASAGASQGPWEDDYELVPCE— GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

Db 648 EHGPERKISFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI 70 7 

Qy 582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: II I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 

Db 70 8 lEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYA 76 7 

Qy 6 42 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : I : : I : I : I I : : I I I I : I 

Db 76 8 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 82 7 

Qy 678 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I I I I : I I : I : I : I I : I : : : : : I I I 

Db 828 DPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 88 7 

Qy 733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

Db 888 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 
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RESULT 7 
ABP58666 

ID ABP58666 standard; protein; 920 AA. 

XX 

AC ABP5 866 6; 
XX 

DT 24-MAR-2003 (first entry) 

XX 

DE Human dihydropyrimidinase related protein 1-101.20. 

XX 

KW Human; dihydropyrimidinase related protein 1-101.20; 

KW recombinant production; gene therapy; psychosis; development disorder; 

KW uracil-related metabolic disorder; thymine-related metabolic disorder; 

KW pyrimidine metabolic disorder. 
XX 

OS Homo sapiens . 
XX 

FN CN1364894-A. 
XX 

PD 21-AUG-2002. 
XX 

PF lO-JAN-2001; 2001CN-00105195 . 
XX 

PR lO-JAN-2001; 2001CN-00105195 . 
XX 

PA (BIOW-) BIOWINDOW GENE DEV INC SHANGHAI. 

XX 

PI Mao Y, Xie Y; 

XX 

DR WPI; 2003-000532/01. 

DR N-PSDB; ABZ57080. 
XX 

PT New polypeptide-human dihydropyrimidinase relative protein 1-101, 20 and 

PT polynucleotide for encoding such polypeptide. 

XX 

PS Claim 1; Page 28-30 (Disclosure); 36pp; Chinese. 

XX 

CC The invention relates to human dihydropyrimidinase related protein 1- 

CC 101.20 (ABP58666) and nucleic acids encoding it (ABZ57080). The protein 

CC has a molecular weight of 101.2 kD . The invention also relates to a 

CC method for the recombinant production of the protein, an antagonist of 

CC the protein, and the use of the protein, gene and antagonist in 

CC therapeutic applications. Dihydropyrimidinase related protein 1-101.20 

CC can be used in the treatment of a variety of diseases such as psychosis, 

CC development disorders and uracil- and thymine-related metabolic 

CC disorders. The present sequence represents human dihydropyrimidinase 

CC related protein 1-101.20 

XX 
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SQ Sequence 92 0 AA; 

Query Match 36.0%; Score 1482.5; DB 6; Length 920; 

Best Local Similarity 40.0%; Pred. No. 3.6e-149; 

Matches 325; Conservative 146; Mismatches 272; Indels 69; Gaps 20; 

Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I I : : : : I : I I : I : II 

Db 122 NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRFRRWLPKK 181 

Qy 5 8 NVLL— EVVPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 

: I I : II : : I : I : : I : I : : I I I : I : I : II: I 
Db 182 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNATRSRIVHHILQRIKY-E 23 9 

Qy 115 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I II : I : : : I I I I I I I I I : I : : : I I : I :: II II II II 

Db 240 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 29 9 

Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 23 4 

I I I I I I I II M : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

Db 30 0 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 35 9 

Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 293 

111:1 III 11:1 I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
Db 36 0 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 418 

Qy 294 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

Db 419 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 4 78 

Qy 353 AVVVMCLVSIILYRAIMAIWSRSGNTLLA-AWA SRIASLTGSVV— NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

Db 479 CVVIAAVFGIVIYRVVTV STFAAFKWALIRNNSQVAT-TGTAVCINFCI IMLL 530 

Qy 405 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
Db 531 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 590 

Qy 465 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
Db 591 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 647 

Qy 52 4 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I : I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

Db 6 48 EHGPERKISFPQWEKDYNLQPMNAYGLYDEYLEMILQFGFTTIFVAAFPLAPLLALLNNI 70 7 

Qy 582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I III : : I I I : I I I : I I I I : II I : I : I I : I I I : : I : I I I : I I I 

Db 708 lEIRLDAYKFVTQWRRPLASRAKDIGIGYGILEGIGILSVITNAFVIAITSDFIPRLVYA 76 7 
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Qy 642 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : I : : I : I : I I : : | | | | : | 

Db 76 8 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 82 7 

Qy 6 78 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I I I I : I I : I : I : I I : I : : : : : I I I 

Db 828 DPPHSLVPYGYTLQFWHVLAARLAFIIVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 887 

Qy 733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

Db 888 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 8 
ADK52114 

ID ADK52114 standard; protein; 981 AA. 

XX 

AC ADK52114; 
XX 

DT 15-JUN-2007 (revised) 

DT 20-MAY-2004 (first entry) 

XX 

DE Human atopic dermatitis/psoriasis-associated protein #29. 
XX 

KW Human; atopic dermatitis; psoriasis; dermatological; anti-inflammatory; 

KW antipsoriatic; rash; BOND_PC; transmembrane protein 16C; 

KW chromosome 11 open reading frame 25; 

KW transmembrane protein 16C [Homo sapiens]; TMEM16C; Cllorf25; GENX-3947; 

KW transmembrane protein 16C (eight membrane-spanning domains); 

KW hypothetical protein; hypothetical protein [Homo sapiens]; GO16020; 

KW GO16021; G04185; GO7001. 

XX 

OS Homo sapiens . 
XX 

PN WO2004016785-A1 . 
XX 

PD 26-FEB-2004. 
XX 

PF 06-AUG-2003; 2003WO-JP009999 . 
XX 

PR 06-AUG-2002; 2002 JP-00229319 . 

PR 14-MAY-2003; 2003 JP-00136544 . 
XX 

PA (GENO-) GENOX RES INC. 

PA (UYJU-) UNIV JUNTENDO. 
XX 

PI Itoh M, Ogawa K, Shinagawa A, Sudo H, Ogawa H, Ra C; 

PI Mitsuishi K; 
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XX 

DR WPI; 2004-214514/20. 

DR N-PSDB; ADK52028. 

DR PCiNCBI; gil3899227. 

DR PCiSWISSPROT; Q9BYT9 . 

XX 

PT Detecting atopic dermatitis or psoriasis comprises assaying levels of 

PT expression of an indicator gene at a rash site and non-rash site of a 

PT person with atopic dermatitis or psoriasis. 
XX 

PS Example 2; SEQ ID NO 147; 484pp; Japanese. 
XX 

CC The invention relates to detecting atopic dermatitis or psoriasis 

CC comprising assaying the levels of expression of an indicator gene at a 

CC rash site and non-rash site of a person with atopic dermatitis or 

CC psoriasis, comparing these levels with those of a healthy person, and 

CC determining that if the levels of indicators are higher or lower, then 

CC this indicates the disease. Also included are a reagent for detecting 

CC atopic dermatitis or psoriasis, a kit for screening for treatments, a 

CC transgenic non human vertebrate animal models for the diseases, an agent 

CC for inducing the diseases in mice and a DNA chip for assaying for the 

CC indicator genes. The method is used for treatment, detection and animal 

CC models for research of atopic dermatitis and psoriasis. The present 

CC sequence is a protein encoded by an indicator gene of the invention. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 981 AA; 

Query Match 35.6%; Score 1467.5; DB 8; Length 981; 

Best Local Similarity 40.9%; Pred. No. 1.6e-147; 

Matches 313; Conservative 149; Mismatches 245; Indels 59; Gaps 21; 

Qy 20 WAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNVLLE — WPDV 6 7 

I I I I I I I : : : I : : : : : I : : : I : II: 

Db 214 WDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPMVLDKSAFPDL 2 73 



Qy 



68 



-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GHEKKNLLGIHQ 12 4 



Db 



274 



EESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGISK VGIRK 32 9 



Qy 



125 



LLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 18 0 



Db 



330 



LINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMWYKHQPLDLIR 3 85 



Qy 



181 



RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 4 0 



Db 



386 



LYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEVF-MCPL 444 
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Qy 241 C-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCS 299 

I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : : I I I I 

Db 445 CDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRSILTYTWDLI 504 

Qy 30 0 DYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMC 35 8 

: : I : I I I I I I I I I I I I I : I I : I : I : I I I : : : I : 

Db 505 EWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMISLVITA 56 4 

Qy 359 LVSIILYR-AIMAIVVSRSGNTLLAAWASRIASLTGSV-VNLVFILILSKIYVSLAHVLT 416 

: : : : M : I I I : I : I : : | : | : | : : I : I : I : : I I 

Db 565 VFGVWYRLWMEQFASFKWNFIKQYW— QFATSAAAVCINFIIIMLLNLAYEKIAYLLT 622 

Qy 417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFG-VRNEEC 4 75 

I I I : : : : I : : I I I : I : I M I I II I I I I I I I I I I : I I I : II I III 
Db 623 NLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDRWRLEEC 6 82 

Qy 476 AAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGP 535 

I I I I : I : : I I I I I : I I I : I : : I I : : : II II 

Db 683 HPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KRGIH-DASIPQ 736 

Qy 536 WEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVC 5 93 

I I : I : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I III 

Db 73 7 WENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIEIRLDAYKFVT 796 

Qy 59 4 EYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW T 6 43 

: : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I : 
Db 79 7 QWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPCANHVEP 856 

Qy 644 RAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTYWNLLAIRLAF 697 

: I : I : : I : I : I : I I I I : I : : I I :: I I I I I I 

Db 85 7 SENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHILAARLAF 916 

Qy 698 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 743 

: I I I I I : I I : : I : I I : I : : : : : | | | | : : : | | : 
Db 917 IIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 

RESULT 9 
AEN06206 

ID AEN06206 standard; protein; 981 AA. 

XX 

AC AEN06 206; 
XX 

DT 15-JUN-2007 (revised) 

DT 22-FEB-2007 (first entry) 

XX 

DE Human esophageal cancer-associated protein SEQ ID NO 231. 

XX 
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KW diagnostic; metastasis; esophagus tumor; gastrointestinal disease; 

KW neoplasm; cytostatic; cancer; AXL; ZBTBll; TNFRSF14; NSUN5; SPEN; LTBP3; 

KW SYNGRl; SLC13A1; MAP3K12; GLYAT; ZNF659; B4GALT2; POGK; AQP3; CAPG; 

KW SLIT2; BOND_PC; transmembrane protein 16C; 

KW chromosome 11 open reading frame 25; 

KW transmembrane protein 16C [Homo sapiens]; TMEM16C; Cllorf25; GENX-3947; 

KW transmembrane protein 16C (eight membrane-spanning domains); 

KW hypothetical protein; hypothetical protein [Homo sapiens]; GO16020; 

KW GO16021; G04185; GO7001. 

XX 

OS Homo sapiens. 
XX 

PN WO2006118308-A1 . 

XX 

PD 09-NOV-2006. 
XX 

PF 02-MAY-2006; 2006WO-JP309177 . 
XX 

PR 02-MAY-2005; 2005 JP-00134530 . 

PR 13-SEP-2005; 2005JP-00265645 . 

PR 13-SEP-2005; 2005JP-00265678 . 
XX 

PA (TORA ) TORAY IND INC. 

PA (KYOU ) UNIV KYOTO. 
XX 

PI Alciyama H, Kozono S, Myomoto A, Nomura 0, Nobumasa H, Tana]<:a Y; 

PI Tomoda S, Shimada Y, Tsujimoto G; 

XX 

DR WPI; 2007-110304/11. 

DR PC:NCBI; gil3899227. 

DR PC:SWISSPR0T; Q9BYT9 . 
XX 

PT Composition for determining occurrence/metastasis of esophageal cancer in 

PT subject, comprises an antibody binding to a polypeptide encoded by a 

PT polynucleotide having a sequence of genes e.g. AXL, ZBTBll and TNFRSF14, 

PT and/or polynucleotides. 
XX 

PS Claim 1; SEQ ID NO 231; 142pp; Japanese. 
XX 

CC This invention describes a novel composition for detecting metastasis of 

CC esophageal cancer in a test subject. The composition contains a probe 

CC derived from polynucleotides AXL, ZBTBll, TNFRSF14, NSUN5, SPEN, LTBP3, 

CC SYNGRl, SLC13A1, MAP3K12, GLYAT, ZNF659, B4GALT2, POGK, AQP3, CAPG, 

CC SLIT2, their variants or fragments and an antibody. The invention also 

CC claims: a) a kit for detecting, determining or presuming the occurrence 

CC or metastasis of esophageal cancer in a test subject; b) a DNA chip for 

CC detecting, determining or presuming the occurrence or metastasis of 

CC esophageal cancer and c) a method to detect, determine or presume the 

CC occurrence or metastasis of esophageal cancer in a test subject by 
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CC detecting the presence of or amount or expression level of one or more 

CC esophagus-cancer related target nucleic acid in a biological sample. The 

CC method enables the rapid and convenient detection of occurrence or 

CC metastasis of esophageal cancer in test subject with high sensitivity. 

CC This sequence represents a protein used in the method of the invention 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 981 AA; 

Query Match 35.6%; Score 1467.5; DB 12; Length 981; 

Best Local Similarity 40.9%; Pred. No. 1.6e-147; 

Matches 313; Conservative 149; Mismatches 245; Indels 59; Gaps 21; 

Qy 2 0 WAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNVLLE — VVPDV 6 7 

I I I I I I I : : : I : : : : : I : : : I : II: 

Db 214 WDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPMVLDKSAFPDL 2 73 

Qy 6 8 -PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GHEKKNLLGIHQ 12 4 

: I : I : : I : : I : I I I I : : I : I : : : I : I I I I : I I : 
Db 274 EESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGISK VGIRK 329 

Qy 125 LLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I : I I I I I I : I : I : III I I : I :: I I I I I I I : I I I I : I 

Db 330 LINNGSYIAAFPPHEGAYKSSQPIKTHGPQ NNRHLLYERWARWGMWYKHQPLDLIR 385 

Qy 181 RYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I : I I I I I I I : I I I I : I I I : I I II II:: : I I : I : : I I I I I 
Db 386 LYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEVF-MCPL 444 

Qy 241 C-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCS 299 

I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : : I I I I 

Db 445 CDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRSILTYTWDLI 504 

Qy 30 0 DYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMC 35 8 

: : I : I I I I I I I I I M II : I I : I : I : I I I : : : I : 

Db 505 EWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMISLVITA 56 4 

Qy 359 LVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSKIYVSLAHVLT 416 

: : : : I I : I I I : I : I : : I : I : I : : I : I : I : : I I 

Db 565 VFGVWYRLWMEQFASFKWNFIKQYW — QFATSAAAVCINFI I IMLLNLAYEKIAYLLT 6 22 

Qy 417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFG-VRNEEC 4 75 

I II : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : II I : II I III 
Db 623 NLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDRWRLEEC 6 82 

Qy 476 AAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGP 535 

I I I I : I : : I I I I I : I I I : I : : I I : : : II II 
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Db 683 HPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KRGIH-DASIPQ 736 



Qy 536 WEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVC 5 93 

I I : I : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I III 

Db 73 7 WENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNIIEIRLDAYKFVT 796 

Qy 594 EYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW T 6 43 

: : I I I : I I I I I I I III: I I I I : I I I : : I : I I : : I I I : 

Db 79 7 QWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPCANHVEP 856 

Qy 6 44 RAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQTYWNLLAIRLAF 697 

: I : I : : I : I : I : I I I I : I : : I I :: I I I I I I 

Db 85 7 SENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHILAARLAF 916 

Qy 698 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 743 

: II I II : I I : : I : I I : I : : : : : | | | | : : : | | : 
Db 917 IIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 10 
AEG11146 

ID AEG11146 standard; protein; 840 AA. 

XX 

AC AEG11146; 
XX 

DT 15-JUN-2007 (revised) 

DT 20-APR-2006 (first entry) 

XX 

DE Human transmembrane protein 16A, SEQ ID NO: 11. 

XX 

KW Genetic marker; diagnostic; prognosis; gastrointestinal tumor; 

KW cytostatic; neoplasm; tumor marker; transmembrane protein 16A; BOND_PC; 

KW TMEM16A protein; TMEM16A protein [Homo sapiens] . 

XX 

OS Homo sapiens . 

XX 

PN US2006040292-A1 . 

XX 

PD 23-FEB-2006. 

XX 

PF 08-JUL-2005; 2005US-00177894 . 
XX 

PR 08-JUL-2004; 2004US-0586676P . 
XX 

PA (WEST/) WEST R B. 

PA (VRIJ/) VAN DE RUN M. 

XX 

PI West RB, Van De Rijn M; 
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DR WPI; 2006-182760/19. 

DR N-PSDB; AEG11141. 

DR GENBANK; AAH33036. 

DR PC:NCBI; gi34192278. 
XX 

FT Classifying tumor as gastrointestinal stromal tumor belonging to PDGFRA 

PT positive subclass, involves detecting expression or activity of gene 

PT encoding DOGl polypeptide in sample. 

XX 

PS Disclosure; SEQ ID NO 11; 177pp; English. 

XX 

CO The present invention relates to three gene markers such as DOGl, KIT and 

CO platelet derived-growth factor receptor alpha (PDGFRA) that are useful in 

CC classifying tumors. These gene markers are useful in the classification 

CC of gastrointestinal stromal tumors (GISTs) and tumors other than GISTs. 

CC The invention also relates to methods providing diagnostic, prognostic 

CC and predicative information based on the classifying step. The invention 

CC is useful for classifying gastrointestinal stromal tumors as belonging to 

CC a PDGFRA positive subclass, KIT negative or PDGFRA negative subclass. The 

CC present sequence is human transmembrane protein 16A (DOGl; TMEM16A) . The 

CC DOGl gene encodes a transmembrane protein of unknown function 

CC (transmembrane protein 16A) . The transmembrane protein 16A is encoded by 

CC DOGl gene that is mapped to llql3.2 on chromosome 11. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 840 AA; 

Query Match 35.4%; Score 1461.5; DB 11; Length 840; 

Best Local Similarity 40.1%; Pred. No. 5.6e-147; 

Matches 331; Conservative 149; Mismatches 260; Indels 85; Gaps 21; 

Qy 5 QDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: I : I : I : : I I I I I I I I : I I : I : : : : I : I I I I : I I 

Db 2 8 RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH — INETRGLLK — KINSVLQ 83 

Qy 6 2 EVVPDVPPEYYSCR FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 111 

: : : I : I I I I I I : I : I I I I I : : I I I : I 

Db 84 KITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTT 142 

Qy 112 YGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 171 

I : : I I I I I I I : I I : I I I I I : : I I : : I : : I I I : I : 

Db 143 CTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFY 196 

Qy 172 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 231 

I I I I : I 11:11111: I I I I I I I II I : I I : : I I : I I I II : : I I : I : I 

Db 19 7 KYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGIIVFLYGCATMDENIPSMEMCDQ 256 
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Qy 232 KDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSA 290 

: : I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I 
Db 25 7 RHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQM 316 

Qy 291 TLAYRWDCSDYEDTEERPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLAGS 345 

I M M : : I : I : I I : : I I : I I I : II I 

Db 317 RLNYRWDLTGFEEEEDHPRAEYEARVLEKSLKKESRNKET — DKVKLTWRDRFPAYLTNL 3 74 

Qy 346 WIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSVVNLVFILILS 405 

I I : I : I I : : : I : I I II : : : : : : : I : : I I I I : : I 

Db 375 VSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLWIILLD 434 

Qy 406 KIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYH 46 5 

: : I : I I I : I : : I : I I : | | : : | | | | : I : I I I I I I I I I I I : I 

Db 435 EVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGDYV 49 4 

Qy 466 TLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSKKR 523 

: I 111111111:11 : I : II : I I I : I I I : I : I I I : I : : I : : 
Db 495 YIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIRYLKLKQQSP 554 

Qy 52 4 KAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVE 5 83 

: : I II I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I : I 
Db 555 PDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLNNIIE 614 

Qy 584 IRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA— YYR 6 41 

11111:111 I I I I I I I I : I I I I I :: I I I : I I I I : I I : : : I : I I I : I I I 
Db 615 IRLDAKKFVTELRRPVAVRAKDIGIWYNILRGIGKLAVIIDAFVISFTSDFIPRLVYLYM 6 74 

Qy 642 WTRAHDLRGFLNFTLARAPSSF AAAHN RTCRYRAFRD DDGH 6 82 

: : : : I I : I I I III II : I I I : : I : : 

Db 6 75 YSKNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSENK 73 0 

Qy 6 83 Y— SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA 73 4 

I I : : I : I I I I I I I I I I : : : I : : | : : | | | | : : : : : | | 
Db 731 YDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELFMR 790 

Qy 735 KQALAENEVLFGTNGTKDEQP KGSELSSH 763 

I I I I I I I I I I I I 

Db 791 EEQDKQQLL — ETCMEKERQKDEPPCNHHNTKACPDSLGSPAPSH 833 



RESULT 11 
AEG11142 

ID AEG11142 standard; protein; 960 AA. 

XX 

AC AEG11142; 

XX 

DT 15-JUN-2007 (revised) 

DT 20-APR-2006 (first entry) 
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XX 

DE Human transmembrane protein 16A, SEQ ID NO: 7. 

XX 

KW Genetic marker; diagnostic; prognosis; gastrointestinal tumor; 

KW cytostatic; neoplasm; tumor marker; transmembrane protein 16A; BOND_PC; 

KW transmembrane protein 16A; 

KW transmembrane protein 16A (eight membrane-spanning domains); 

KW oral cancer overexpressed 2; membrane protein; 

KW tumor amplified and overexpressed sequence 2; 

KW transmembrane protein 16A [Homo sapiens]; TMEM16A; TA0S2; 0RA0V2; 

KW FLJ10261. 

XX 

OS Homo sapiens . 

XX 

PN US2006040292-A1 . 
XX 

PD 23-FEB-2006. 

XX 

PF 08-JUL-2005; 2005US-00177894 . 
XX 

PR 08-JUL-2004; 2004US-0586676P . 
XX 

PA (WEST/) WEST R B. 

PA (VRIJ/) VAN DE RUN M. 

XX 

PI West RB, Van De Rijn M; 

XX 

DR WPI; 2006-182760/19. 

DR N-PSDB; AEG11136. 

DR REFSEQ; NP_060513. 

DR PC:NCBI; gi40354210. 

XX 

PT Classifying tumor as gastrointestinal stromal tumor belonging to PDGFRA 

PT positive subclass, involves detecting expression or activity of gene 

PT encoding DOGl polypeptide in sample. 

XX 

PS Disclosure; SEQ ID NO 7; 177pp; English. 

XX 

CC The present invention relates to three gene markers such as DOGl, KIT and 

CC platelet derived-growth factor receptor alpha (PDGFRA) that are useful in 

CC classifying tumors. These gene markers are useful in the classification 

CC of gastrointestinal stromal tumors (GISTs) and tumors other than GISTs. 

CC The invention also relates to methods providing diagnostic, prognostic 

CC and predicative information based on the classifying step. The invention 

CC is useful for classifying gastrointestinal stromal tumors as belonging to 

CC a PDGFRA positive subclass, KIT negative or PDGFRA negative subclass. The 

CC present sequence is human transmembrane protein 16A (DOGl; TMEM16A) . The 

CC DOGl gene encodes a transmembrane protein of unknown function 

CC (transmembrane protein 16A) . The transmembrane protein 16A is encoded by 
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CC DOGl gene that is mapped to llql3.2 on chromosome 11. 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 960 AA; 

Query Match 35.3%; Score 1456.5; DB 11; Length 96 0; 

Best Local Similarity 40.2%; Pred. No. 2.4e-146; 

Matches 333; Conservative 147; Mismatches 260; Indels 89; Gaps 22; 

Qy 5 QDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: I : I : I : : I I I I I I I I : I I : I : : : : I : I I I I : I I 

Db 144 RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH — INETRGLLK — KINSVLQ 199 

Qy 6 2 EVVPDVPPEYYSCR FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 111 

: : : I : I I I I I I : I : I I I I I : : I I I : I 

Db 200 KITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTT 258 

Qy 112 YGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 171 

I : : I I 11111:11:11111: : I I : : I : : I I I : I : 

Db 259 CTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFY 312 

Qy 172 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 231 

I I I I : I 11:11111: I I I I I I I II I : I I : : I I : I I I II : : I I : I : I 

Db 313 KYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGIIVFLYGCATMDENIPSMEMCDQ 3 72 

Qy 232 KDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSA 290 

: : I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I 
Db 373 RHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQM 432 

Qy 291 TLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRM 341 

I II M : : I : I I I I : : I I : I I I : || 

Db 433 RLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLKKESRNKET— DKVKLTWRDRFPAY 49 0 

Qy 342 LAGSVVIVVMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLVFI 401 

I !!:!:!! : : : I : I I II : : : : : : : I ::!!!! 

Db 491 LTNLVSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLVVI 55 0 

Qy 402 LILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYP 461 

: : I : : I : I I I : I : : I : I I : | | : : | | | | : I : I I I I I I I I I I 

Db 551 ILLDEVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVGRP 610 

Qy 462 GNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLR 519 

1:1 : I 111111111:11 : I : I I : I I I : I II : I : I II : I : : I : 
Db 611 GDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKLIRYLKLK 6 70 

Qy 520 SKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLN 5 79 

: : : I I I I I II I I : I I :: I I I I I I : I I I : I I I I I I I I I I 
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Db 6 71 QQSPPDHEECVKRKQRYEVDYNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLN 73 0 



Qy 



580 NWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRA- 63 8 




Db 



731 NIIEIRLDAKKFVTELRRPVAVRAKDIGIWYNILRGIGKLAVIINAFVISFTSDFIPRLV 79 0 



Qy 



639 -YYRWTRAHDLRGFLNFTLARAPSSF- 



-AAAHN RTCRYRAFRD 6 78 



Db 



791 YLYMYSKNGTMHGFVNHTL : 



-SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPW 8 46 



Qy 



6 79 DDGHY— SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA— 73 4 



Db 



84 7 SENKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVE 9 06 



Qy 



735 KQALAENEVLFGTNGTKDEQP KGSELSSH 763 



Db 



90 7 LFMREEQDKQQLL — ETWMEKERQKDEPPCNHHNTKACPDSLGSPAPSH 953 



RESULT 12 
AFB77190 

ID AFB77190 standard; protein; 1017 AA. 
XX 

AC AFB77190; 
XX 

DT 28-JUN-2007 (first entry) 
XX 

DE Mouse TM-1 (Tmeml6a) protein, 
XX 

KW Cell isolation; stem cell; therapeutic; transgenic animal; screening; 

KW tissue regeneration; genitourinary disease; uropathic; 

KW intervertebral disk displacement; degeneration; injury; vulnerary; 

KW back pain; transmembrane factor-1; Tmeml6a. 

XX 

OS Mus musculus. 

XX 

PN WO2007027583-A2 . 
XX 

PD 08-MAR-2007. 

XX 

PF 28-AUG-2006; 2006WO-US033491 . 
XX 

PR 31-AUG-2005; 2005US-0713400P . 
XX 

PA (UYFL ) UNIV FLORIDA RES FOUND INC. 
XX 

PI Harfe BD, Cohn MJ; 
XX 

DR WPI; 2007-412931/39. 
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DR N-PSDB; AFB77189. 
XX 

PT Isolating sonic hedgehog expressing-cells comprises obtaining a non-human 

FT transgenic subject in which a marker gene has been inserted into the 

PT subject's genome. 

XX 

PS Disclosure; SEQ ID NO 2; 96pp; English. 

XX 

CC The present invention relates to a method of isolating cells in selected 

CC tissues co-expressing the sonic hedgehog (Shh) gene and a marker gene. 

CC The method involves obtaining a non-human transgenic subject in which a 

CC marker gene has been inserted into the subject's genome and isolating 

CC Shh/marker gene expressing cells and Shh/marker gene non-expressing cells 

CC from the selected tissue. The invention further provides a method of 

CC identifying differentially expressed genes (e.g. transmembrane factors TM 

CC -1 and TM-2, EST 1437418, Mmu-miR-135a-2 and AP-2 beta) in selected 

CC tissues co-expressing the sonic hedgehog gene and a marker gene. The 

CC invention is useful in tissue engineering, regeneration, reconstruction 

CC and/or repair of tissues and genitourinary system and also in treating 

CC intervertebral disk rupture, degeneration, disease or injury and back 

CC pain. The invention is further useful for generating transgenic animal. 

CC The present sequence is the mouse TM-1 (Tmeml6a) protein. 

XX 

SQ Sequence 1017 AA; 

Query Match 35.2%; Score 1452.5; DB 12; Length 1017; 

Best Local Similarity 40.0%; Pred. No. 7e-146; 

Matches 330; Conservative 156; Mismatches 257; Indels 81; Gaps 22; 

Qy 6 DGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLE 6 2 

I : I : I : : I I I I I I I I : I I : I : : : : : : I I I I : I I : 

Db 202 DEDTKIHGVGFVKIHAPWHVLCREAEFLKLKMPTKKVYHISE — TRGLLK — TINSVLQK 257 

Qy 63 WPDVPPEYYSCRFRVNKLPRFLGS DNQDTFFTSTKRHQILFEILAKTPYG 113 

: : I : I : I : I : : I : I I I I I : : I I I : 1 

Db 25 8 ITDPIQPKVAEHRPQTTKRLSYPFSREKQHLFDLTDRDSFFDSKTRSTIVYEILKRTTCT 317 

Qy 114 HEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKY 173 

I : : I I 11111111:11111: II | | : : | : : | | : | : | | 

Db 318 KAKYS-MGITSLLANGVYSAAYPLHDGDY EGDNV-EFNDRKLLYEEWASYGVFYKY 3 71 

Qy 174 QPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKD 233 

I I : I 11:111111 I I I I I I I II I : I I : : I I : I I I II I : I I : I : I : 
Db 372 QPIDLVRKYFGEKVGLYFAWLGAYTQMLIPASIVGVIVFLYGCATVDENIPSMEMCDQRY 431 

Qy 234 SFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATL 292 

: I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I I 
Db 432 NITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQMRL 491 
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Qy 293 AYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLA 343 

I I I I : : I : I I I I : : I I : I I I : II 

Db 492 NYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLRKESRNKET — DKVKLTWRDRFPAYFT 549 

Qy 344 GSVVIVVMVAVVVMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSVVNLVFILI 403 

I I : I : I I : : : I : I I I : : : : : : : I : : I I I I : : 

Db 550 NLVSIIFMIAVTFAIVLGVIIYRISTAAALAMNSSPSVRSNIRVTVTATAVIINLWIIL 609 

Qy 404 LSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGN 463 

I : : I : I I I : I : : i : 11: | | | : : | | | | : I : I I I I I I I I I I I : 
Db 610 LDEVYGCIARWLTKIEVPKTEKSFEERLTFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGD 66 9 

Qy 46 4 YHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSK 521 

I : I 111111111:11 : I : I I : I I I : I I I : I : I I I : I : : : I I : 
Db 6 70 YVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKFIRYLKLRRQ 72 9 

Qy 522 KRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: : I I : I I II I I : || : : II II II : I II : I I I I I I I I I I I 
Db 730 SPSDREEYVKRKQRYEVDFNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLNNI 789 

Qy 582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I : I I 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 :: I I I : I I I I I I I : : : I : I I I : I I I 
Db 790 lEIRLDAKKFVTELRRPVAIRAKDIGIWYNILRGVGKLAVIINAFVISFTSDFIPRLVYL 849 

Qy 642 WTRAHD— LRGFLNFTLARAPSSF AAAHN RTCRYRAFRD DD 680 

: : : : I I : I I I III II : I I I : : I : : 

Db 850 YMYSQNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSE 9 05 

Qy 681 GHY — SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYL 733 

I I : : I : I I I I I I I I I I : : : I : : | : : | | | | : : : : : | | 

Db 906 HKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPKDISQQIHKEKVLMVELF 965 

Qy 73 4 AKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKA 7 72 

III: : : I : I : : : I : I I I : I 

Db 966 MREEQGKQQLLDTWM EKEKPRDVPCNNH-SPTTHPEA 1001 



RESULT 13 
ADG48280 

ID ADG48280 standard; protein; 1003 AA. 
XX 

AC ADG48280; 

XX 

DT ll-MAR-2004 (first entry) 

XX 

DE Human retina-specific protein - C12orf Svariants . 
XX 

KW human; retina-specific protein; NETOl; retinal disease; 

KW age related macular degeneration; night blindness; C12orf 3variants . 
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XX 

OS Homo sapiens . 
XX 

PN WO2003068967-A2 . 
XX 

PD 21-AUG-2003. 
XX 

PF 18-FEB-2003; 2003WO-EP001625 . 

XX 

PR 18-FEB-2002; 2002EP-000036 75 . 

PR 21-FEB-2002; 2002US-0357857P . 
XX 

PA (LYNK-) LYNKEUS BIO TECH GMBH. 

XX 

PI Stoehr BH, Weber FHB, Goehring F; 
XX 

DR WPI; 2003-767334/72. 

DR N-PSDB; ADG48279. 
XX 

PT New nucleic acid encoding retinal protein sNETOl, useful for diagnosis of 

PT retinal disease, especially macular degeneration, also for drug screening 

PT and therapy. 
XX 

PS Claim 18; Fig 14; 199pp; English. 

XX 

CC The invention comprises the amino acid and coding sequences of a human 

CC retina-specific protein - NETOl . The DNA and protein sequences of the 

CC invention are useful in the treatment of retinal diseases, such as 

CC macular degeneration (especially age related) and night blindness. The 

CC present amino acid sequence represents the human retina-specific protein 

CC C12orf 3variants . 

XX 

SQ Sequence 1003 AA; 

Query Match 34.9%; Score 1437; DB 7; Length 1003; 
Best Local Similarity 38.7%; Pred. No. 3.2e-144; 

Matches 326; Conservative 161; Mismatches 263; Indels 92; Gaps 23; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQ ASNWSAGLLAW 53 

: : I : : : : : : I I I I I I I : : I : I : : : I : I I I 

Db 174 EKDLENKSQGSIFVRIHAPWQVLAREAEFLKIKVPTKKEMYEIKAGGSIAKKFSAAL 230 

Qy 5 4 LGIPNVLLEWPDVPPEYYSCRFRVNKLP RFLGSDNQDTFFTSTKRHQILFEIL 10 7 

: : I I I I : : : : I : I I I I : I : I : I I I 

Db 231 QKLSSHLQPRV-PEHSNNKMKNLSYPFSREKMYLYNIQEKDTFFDNATRSRIVHEIL 286 

Qy 108 AKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARW 16 7 
: I I : I I : I : I : | | : | | | | | : : | : : | | : : | : | | | | : 

Db 287 KRTACS-RANNTMGINSLIANNIYEAAYPLHDGEYDSPEDD MNDRKLLYQEWARY 340 
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Qy 16 8 GKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQE 22 7 

I : I : I I : I : I : I I I I I : I I I I I I I II : I : I : : I : I : I I I I I : I I I : : I 

Db 341 GVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVIVFLYGCATIEEDIPSRE 400 

Qy 228 LCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWK 286 

: I : : : I I I I I I I : I I I I I I III III: I I I I I : I I I I I I : I I I I 
Db 401 MCDQQNAFTMCPLCDKSCDYWNLSSACGTAQASHLFDNPATVFFSIFMALWATMFLENWK 46 0 

Qy 28 7 RKSATLAYRWDCSDYEDTEER PRPQFAA SAPMTAPNPIT G 326 

I I I I I : I : I I I II:: II I 

Db 461 RLQMRLGYFWDLTGIEEEEERAQEHSRPEYETKVREKMLKESNQSAVQKLETNTTECGDE 52 0 

Qy 327 EDEPYFPERSRARRMLAGSWIWMVAVWMCLVSIILYRAIMAIVVSRSGNTLLAAWAS 386 

: I I : I I I : I : I : : : I : I I I : I III 
Db 521 DDEDKLTWKDRFPGYLMNFASILFMIALTFSIVFGVIVYRITTAAALS LNKATRS 5 75 

Qy 38 7 RI ASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNF 443 

: : I : : I I I I I I I : I I : : I I I : I : : I : I I : I I I :: I I I 

Db 576 NVRVTVTATAVIINLWILILDEIYGAVAKWLTKIEVPKTEQTFEERLILKAFLLKFVNA 635 

Qy 444 YSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNM 501 

II 1:111111111 I I : I : I I I I I I I I I I I : II : I : I I : I I I : I II: 

Db 636 YSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELCIQLSIIMLGKQLIQNNI 6 95 

Qy 502 QEVLIPKLKGWWQKFRLRSKKRPCAGASAGA-SQGP — WEDDYELVPCEGLFDEYLEMVLQ 55 8 

I : : I I I I : I I : : M : I I : I I : I I I I II I I : I I : : I 

Db 696 FEIGVPKLK KLFRKLKDETEAGETDSAHSKHPEQWDLDYSLEPYTGLTPEYMEMI IQ 752 

Qy 559 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 618 

11111:111: 1111:111111 : I : I I I I : I I I I III I I : I I I I I I I I : I : 
Db 753 FGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAVRTKDIGIWFDILSGIGK 812 

Qy 619 LAVISNAFLLAFSSDFLPRAYYRWTRAHD— LRGFLNFTLA RAPSSFAA 665 

: I I I I I I : : I : I I I : I I I : : : : I : I I I : I I I : : : I 

Db 813 FSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHTLSFFNVSQLKEGTQPENSQFD 8 72 

Qy 666 AHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIP 720 

: I I : : : I : : : I : I I : I : I I I I I I : I : : : I : | : | : : | | | | 

Db 873 QEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVIIFQNLVMFLSVLVDWMIPDIP 932 

Qy 721 ESVEIKVKRE YYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKA-S 773 

: : : I : I : : I : I : I I : I I : I : I I 

Db 933 TDISDQIKKEKSLLVDFFLKE EHEKLKLMDEPALRSPGGGDRSRSRAASSAPSGQS 988 

Qy 774 QL 775 

I I 

Db 989 QL 990 
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RESULT 14 
AEH82071 

ID AEH82071 standard; protein; 913 AA. 

XX 

AC AEH82071; 
XX 

DT 15-JUN-2007 (revised) 

DT 13-JUL-2006 (first entry) 
XX 

DE Human gnathodiaphyseal dysplasia protein, GDDl . 
XX 

KW Osteopathic; Gene therapy; bone disease; bone injury; bone resorption; 

KW gnathodiaphyseal dysplasia; GDDl; BOND_PC; transmembrane protein 16E; 

KW integral membrane protein GDDl; transmembrane protein 16E [Homo sapiens]; 

KW TMEM16E; GDDl; integral membrane protein GDDl [Homo sapiens]; G05783; 

KW GO16020; GO16021. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 356 

FT /note= "Missense mutations in the coding sequence can 

FT lead to substitution of this residue with either Arg or 

FT Gly" 
XX 

PN JP2006121961-A. 
XX 

PD 18-MAY-2006. 

XX 

PF 28-OCT-2004; 2004JP-00313511 . 
XX 

PR 28-OCT-2004; 2004JP-00313511 . 
XX 

PA (UYTO-) UNIV TOKUSHIMA NAT UNIV CORP. 

XX 

PI Itakura M, Tsutsumi S, Kamata N, Inoue H; 

XX 

DR WPI; 2006-367194/38. 

DR N-PSDB; AEH82070. 

DR PCiNCBI; gi47106048. 

DR PCrSWISSPROT; Q75V66 . 

XX 

PT Novel gnathodiaphseal dysplasia DNA, useful as diagnostic agent for bone 

PT disease such as gnathodiaphseal dysplasia, bone deficiency or bone- 

PT resorption property disease. 
XX 

PS Claim 9; SEQ ID NO 2; llpp; Japanese. 

XX 
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CC The present invention relates to a human gnathodiaphyseal dysplasia (GDD) 

CC coding sequence (GDDl; AEH82070) and encoded protein (AEH82071) . GDDl is 

CC useful as a bone disease diagnostic agent, where the bone disease is GDD, 

CC bone deficiency and/or bone-resorption property disease, where the GDD 

CC disease causes hardening of bone, susceptibility to fracture, cement bone 

CC pathology of a lower jaw bone, etc. GDDl is also useful in bone formation 

CC regeneration, hard tissue reconstruction, etc., and in research 

CC application. 
CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 913 AA; 

Query Match 34.3%; Score 1412.5; DB 11; Length 913; 

Best Local Similarity 39.2%; Pred. No. 1.2e-141; 

Matches 309; Conservative 148; Mismatches 259; Indels 73; Gaps 20; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQE — LPNQASNWSAGLLAWLGIPN 5 8 

: : I : I I I : : : I I I I I I I I : I : I : : I : I : : I : : I 

Db 120 KRDSEDGRT— YFVKIHAPWEVLVTYAEVLGIKMPIKESDIPRPKHTPISYVLGPVRLP- 176 

Qy 5 9 VLLEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEK-K 117 

I I I I I : : : I : : II I I I I I I : I : : I : : I I : : I : I I I 

Db 177 -LSVKYPH— PEYFTAQFSRHRQELFLIED-QATFFPSSSRNRIVYYILSRCPFGIEDGK 232 

Qy 118 NLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLD 177 

11:11 I : I : I I I I I : III I : I I I : I II : : I I I I I 

Db 233 KRFGIERLLNSNTYSSAYPLHDGQYWKPSEPPNP — TNERYTLHQNWARFSYFYKEQPLD 29 0 

Qy 178 HVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSK— DSF 235 

: : I : I I I : : I I : I I I I I I I I I I I I : I : : : I : I : 
Db 291 LIKNYYGEKIGIYFVFLGFYTEMLFFAAWGLACFIYGLLSMEHNTSSTEICDPEIGGQM 350 

Qy 236 EMCPLCLD-CPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAY 29 4 

I I I I I I : I I : I I : : I I I : | | | | : : | | : | | | | : | | : : | | | 

Db 351 IMCPLCDQVCDYWRLNSTCLASKFSHLFDNESTVFFAIFMGIWVTLFLEFWKQRQARLEY 410 

Qy 295 RWDCSDYEDTEE — RPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMV 352 

II I : I : : : : I I : I I I : I I I I I I : I : : : 

Db 411 EWDLVDFEEEQQQLQLRPEFEAMCKHRKLNAVTKEMEPYMPLYTRIPWYFLSGATVTLWM 4 70 

Qy 353 AVVVMCLVSIILYRAIMAIWSRSGNTLLAAWASRI ASLTGS 394 

: : I I : I : : I : I I : : I : I I : I I I I I 

Db 471 SLVVTSMVAVIVYRL SVFATFASFMESDASLKQVKSFLTPQITTSLTGS 519 

Qy 395 WNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFK 45 4 

: I : I I I I : I : : : I : I : I I : : I : I I I : I : I I I I I I I I I I : I I I I 

Db 520 CLNFIVILILNFFYEKISAWITKMEIPRTYQEYESSLTLKMFLFQFVNFYSSCFYVAFFK 5 79 
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Qy 455 GRFVGYPGNYHTLFGV-RNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWW 513 

1:1111111 II I : I I I I I I I I I I : I : I I I I I : I : : I : I II 

Db 58 0 GKFVGYPGKYTYLFNEWRSEECDPGGCLIELTTQLTIIMTGKQIFGNIKEAIYPLALNWW 63 9 

Qy 514 QKFRLRSKKRKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPL 5 71 

: : I I I : : I I I : : I I I I I I I I I I I I I I I : I I I : I I 

Db 640 RRRKARTNSEKLYSRWEQDHDLESFGPLGLFYEYLETVTQFGFVTLFVASFPL 6 92 

Qy 572 APLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFS 631 

M I II : M I I I I : I I I : I I I I I : I I I : I III: I : I : I || : : II : 
Db 693 APLLALINNIVEIRVDAWKLTTQYRRTVASKAHSIGVWQDILYGMAVLSVATNAFIVAFT 752 

Qy 632 SDFLPRAYYRW TRAHDLRGFLN FTLARAPSSFAAAHNR TCRYRAFR — 6 77 

I I : I I I : : I : : I I : I I : I : I I I I I I : I 

Db 753 SDIIPRLVYYYAYSTNATQPMTGYVNNSLSVFLIADFPNHTAPSEKRDFITCRYRDYRYP 812 

Qy 6 78 -DDDGHY— SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA 73 4 

II: I : : I : : I I : : I : I I I I I I I I II : : I I : I : I : : I I I : 
Db 813 PDDENKYFHNMQFWHVLAAKMTFIIVMEHWFLVKFLLAWMIPDVPKDWERIKREKLMT 8 72 

Qy 735 KQALAENEV 743 

: I : I : 
Db 873 IKILHDFEL 881 



RESULT 15 
ABB62812 

ID ABB62812 standard; protein; 1219 AA. 
XX 

AC ABB62812; 

XX 

DT 15-JUN-2007 (revised) 

DT 26-MAR-2002 (first entry) 

XX 

DE Drosophila melanogaster polypeptide SEQ ID NO 15228. 

XX 

KW Drosophila; developmental biology; cell signalling; insecticide; 

KW pharmaceutical; BOND_PC; CG6938-PA; CG6938-PA [Drosophila melanogaster] ; 

KW CG6938. 

XX 

OS Drosophila melanogaster. 

XX 

PN WO200171042-A2 . 

XX 

PD 27-SEP-2001. 

XX 

PF 23-MAR-2001; 2001WO-US009231 . 
XX 
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PR 23-MAR-2000; 2000US-0191637P . 

PR ll-JUL-2000; 2000US-00614150 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL06915. 

DR PC:NCBI; gi24663059. 

XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signaling and cell-cell 

PT interactions . 
XX 

PS Disclosure; SEQ ID NO 15228; 21pp + Sequence Listing; English. 
XX 

CO The invention relates to an isolated nucleic acid detection reagent 

CO capable of detecting 1000 or more genes from Drosophila. The invention is 

CO useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences (ABL16176-ABL30511 ) , expressed DNA 

CC sequences (ABL01840-ABL16175 ) and the encoded proteins (ABB57737- 

CC ABB72072) . The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 

CC 

CC Revised record issued on 15-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 1219 AA; 

Query Match 33.4%; Score 1378.5; DB 4; Length 1219; 

Best Local Similarity 37.8%; Pred. No. 8.5e-138; 

Matches 304; Conservative 146; Mismatches 284; Indels 71; Gaps 19; 

Qy 5 QDGNTTVHYALLSASWAVLCYYAEDLRLKLP LQELPNQASN 45 

II : : I : I I : I I I I I : : 

Db 384 QSFNEKTFFLKIHLPWRLETRLAEVMNLKLPVKRFITISVKPSWDEENWLRNMQYWKDV 443 

Qy 46 WSAGLLAWLGIPNVLLEWPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFE 105 

I I : : III : : I : I : I : I I II : I : : : 

Db 444 WQR-LTKKIQLDQTLLE GETTFKAATANGNPEEQFIVKD-RATAFTSAQRSLMVMQ 49 7 

Qy 106 ILAKTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLN-QRQVLFQHW 16 4 

: I : I I : : : I I : I : : I I I I I : I : : I : : : I : I I : I I 

Db 498 VLIRTPFDESDRS — GIRRLMNDGTYLGCFPLHEGRY DRPHSSGISLDRRVLYQTW 551 
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Qy 16 5 ARWGKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSD — 222 

I : I I I I I I I : I I I : I : I I I I I I I I I I I : I I I I I I I : I : I : 
Db 552 AHPSQWYKKQPLCLVRKYFGDKIALYFCWLGFYTEMLVYPAWGTLCFIYGLATLESEDN 611 

Qy 223 IPTQELCG — SKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAV 2 79 

I : : I : I : : I I I I I : I I : I : : III: I I I I : : I I : I I 

Db 612 TPSKEICNEYGTGNITLCPLCDKACSYQRLSESCLFSRLTYLFDNPSTVFFAIFMSFWAT 6 71 

Qy 280 LLLEYWKRKSATLAYRWDCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRAR 339 

111111:1:11 : I : I I I : I : I I I : I I I I I : I : 

Db 672 TFLELWKRKQSVLVWEWDLHNV-DMDEENRPEFETNATTFRMNPVTREKEPYMSTWNRSI 730 

Qy 340 RMLAGSWIWMVAVWMCLVSIILYRAIMAIWSRSGNTLLAAWASRIASLTGSWNLV 399 

I : ::!::!!: : : jj jj : I : I : I j : j : : : j jj 

Db 731 RFVITGSAVLFMISWLSAVLGTILYRITLVSVIYGGGGFFVKEHAKLFTSVTAALINLV 790 

Qy 400 FILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVG 459 

I : I I : : I I : I M I II I : : I I : : I I : I I : I : I I I I I : I I I I I I I I I 
Db 791 VIMILTRIYHRMAIKLTNLENPRTHTEYEDSYTFKIFFFEFMNFYSSLIYIAFFKGRFFD 850 

Qy 460 YPGNYHT LFGVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQK 515 

III: I : : I : I I I I I I : I : I I I I I I II I I I I I I : 

Db 851 YPGDDQARKSEFFRLKNDICDPAGCLSELCIQLAIIMVGKQCWNNFMEYLFPKFWNWWR- 909 

Qy 516 FRLRSKKRKAGASAGASQGPWEDDYELV-PCE-GLFDEYLEMVLQFGFVTIFVAACPLAP 5 73 

: I : : I Mil: I I I I I I I I I : I I : I I I I : I I I I I I I I 

Db 910 QRKHKQATKDESHLHMAWEQDYHMQDPGRLALFDEYLEMILQYGFVTLFVAAFPLAP 966 

Qy 574 LFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSD 633 

I I I I I I I I I I I I I I I : 111:111 : I I I I : II : I : I I : I I I I : : I : : I I 

Db 96 7 LFALLNNVAEIRLDAYKMVTQARRPLAERVEDIGAWYGILRIITYTAWSNAFVIAYTSD 1026 

Qy 63 4 FLPRAYYR— WTRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFRDDDGH 6 82 

1:11 I : : : III:: : I : : I : I I : I I : I : 

Db 1027 FIPRMVYKFVYSETHTLAGYIEHSLSIFNTSDYKEEWGASVSEKDPDTCQYRGYRNGPKD 1086 

Qy 683 Y SQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQA 73 7 

I I I I :: I I I I I I : I I I I I I I : : : : : I I : I I : : : : I I I I I : I 

Db 1087 YEPYGLSPHYWHVFAARLAFVWFEHWFVITGIMQFIIPDVPSEVKTQMQREQLLAKEA 1146 

Qy 73 8 LAENEVLFGTNGTKDEQPKGSELSS 762 

: : I I I : : I 

Db 1147 KYQ HGIKRAQGDSQDIMS 1164 
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